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Abstract 

Formal models for gene assembly in ciliates have been developed, in 
particular the string pointer reduction system (SPRS) and the graph 
pointer reduction system (GPRS). The reduction graph is a valuable tool 
within the SPRS, revealing much information about how gene assembly 
is performed for a given gene. The GPRS is more abstract than the SPRS 
and not all information present in the SPRS is retained in the GPRS. As 
a consequence the reduction graph cannot be defined for the GPRS in 
general, but we show that it can be defined (in an equivalent manner as 
defined for the SPRS) if we restrict ourselves to so-called realistic overlap 
graphs. Fortunately, only these graphs correspond to genes occurring in 
nature. Defining the reduction graph within the GPRS allows one to carry 
over several results within the SPRS that rely on the reduction graph. 



1 Introduction 

Gene assembly is a biological process that takes place in a large group of one- 
cellular organisms called ciliates. The process transforms one nucleus, called 
the micronucleus, through a large number of splicing operations into another 
nucleus, called the macronucleus. The macronucleus is very different from the 
micronucleus, both functionally and in terms of differences in DNA. Each gene 
occurring in the micronucleus in transformed into a corresponding gene in 
the macronucleus. Two models that are used to formalize this process are the 
string pointer reduction system (SPRS) and the graph pointer reduction system 
(GPRS). The former consist of three types of string rewriting rules operating 
on strings, called legal strings, while the latter consist of three types of graph 
rewriting rules operating on graphs, called overlap graphs. The GPRS can be 
seen as an abstraction of the SPRS, however it is not fully equivalent with the 
SPRS: some information present in the SPRS is lost in the GPRS. 

Legal strings represent genes in their micronuclear form. The reduction graph, 
which is defined for legal strings, is a notion that describes the corresponding 
gene in its macronuclear form (along with its waste products). Moreover, it 
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has been shown that the reduction graph retains much information on which 
string negative rules (one of the three types of string rewriting rules) can be 
or are used in this transformation O [H E] • Therefore it is natural to define an 
equivalent notion for the GPRS. However, as we will show, since the GPRS loses 
some information concerning the application of string negative rules, there is no 
unique reduction graph for a given overlap graph. We will show however, that 
when we restrict ourselves to 'realistic' overlap graph then there is a unique 
reduction graph corresponding to this graph. These overlap graphs are called 
realistic since non-realistic overlap graphs cannot correspond to (micronuclear) 
genes. Moreover, we explicitly define the notion of reduction graph for these 
overlap graphs (within the GPRS) and show the equivalence with the definition 
for legal strings (within the SPRS). Finally, we show some immediate results 
due to this equivalence, including an open problem formulated in Chapter 13 in 

In Section [2] we recall some basic notions and notation concerning sets, strings 
and graphs. In Section [3] we recall notions used in models for gene assembly, 
such as legal strings, realistic strings and overlap graphs. In Section 5] we recall 
the notion of reduction graph within the framework of SPRS and we prove 
a few elementary properties of this graph for legal strings. In particular we 
establish a calculus for the sets of overlapping pointers between vertices of the 
reduction graph. In Section [5] we prove properties of the reduction graph for 
a more restricted type of legal strings, the realistic strings. It is shown that 
reduction graphs of realistic strings have a subgraph of a specific structure, the 
root subgraph. Moreover the existence of the other edges in the reduction graph 
is shown to depend directly on the overlap graph, using the calculus derived in 
the SectionlH In Scction[6]we provide a convenient function for reduction graphs 
(but not only reduction graphs) which simplifies reduction graphs without losing 
any information. In Section [7] we define the reduction graph for realistic overlap 
graphs, and prove the main theorem of this paper: the equivalence of reduction 
graphs defined for realistic strings and reduction graphs defined for realistic 
overlap graphs. In Section [8] we show immediate consequences of this theorem. 



2 Notation and Terminology 

In this section we recall some basic notions concerning functions, strings, and 
graphs. We do this mainly to set up the basic notation and terminology for this 
paper. 

The cardinality of set X is denoted by \X\. The symmetric difference of sets X 
and Y, {X\Y) U {Y\X), is denoted hy X (BY. Being an associative operator, 
we can define the symmetric difference of a family of sets {Xi)iizA and denote 
it by ^^^^Xi. The composition of functions f : X ^ Y and g : Y ^ Z is 
the function gf : X ^ Z such that {gf){x) = g{f{x)) for every x E X. The 
restriction of / to a subset A of X is denoted by f\A. 

We will use A to denote the empty string. For strings u and v, we say that t; is a 
substring oj uiiu = wivw2, for some strings wi, W2', we also say that v occurs in 
u. Also, w is a cyclic substring of u if either v is a, substring of u or u = v^wvi and 
V = V1V2 for some strings vi,V2, w. We say that v is a conjugate ofu\iu = W1W2 
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and V = W2W1 for some strings Wi and W2- For a string u = x\Xi ■■ - Xn over S 
with G S for all i S {1, . . . , n}, wc say that v = .t„.t„_i • • • .xi is the reversal 
of u. A homomorphism is a function : S* ^ A* such that (p{uv) = (p{u)(p{v) 
for all u,v 

We move now to graphs. A labelled graph is a 4-tuple 

G = {V,E,f,r), 

where y is a finite set, E C {{x, y} \ x,y & V,x ^ y}, and f iV ^T. 

The elements of V are called vertices and the elements of E are called edges. 
Function / is the labelling function and the elements of F are the labels. We say 
that G is discrete ii E = 0. Labelled graph G' = {V, E', f\V', F) is a subgraph 
ofGifV CV and E' CEv'=Er\ {{x, y} \ x,y e V ,x y}. If E' = Ey, we 
say that G' is the subgraph of G induced by V. 

A string tt — eie2 ■ ■ ■ e„ (z E* with n > 1 is a path in G if there is a wif 2 • • • fn+i G 
y* such that Ci = {vi,Vi+i} for all 1 < i < n. Labelled graph G is connected 
if there is a path between every two vertices of G. A subgraph H of G induced 
by Vh is a component of G if both is connected and for every edge e € E we 
have either e C Vh or e C y\Vff. 

As usual, labelled graphs G = {V,E,f,r) and G' = (y',^',/',F) are iso- 
morphic, denoted by G « G', if there is a bijection a : V ^ V such that 
/(u) = f'{a{v)) for t; e y, and 

{x,y}eEiS {a{x),a{y)} &E' 

for x,y gV. Bijection a is then called an isomorphism from G to G' . 

In this paper we will consider graphs with two sets of edges. Therefore, we need 
the notion of 2-edge coloured graphs. A 2-edge coloured graph is a 5-tuple 

G = {V,EuE2,f,r), 

where both {V, E\,f, T) and (V, E2, /, F) are labelled graphs. 

The basic notions and notation for labelled graphs carry over to 2-edge coloured 

graphs. However, for the notion of isomorphism care must be taken that the 
two sorts of edges are preserved. Thus, if G = {y,Ei,E2, f,T) and G' = 
iy' ,E[,E'2, f ,T') are 2-edge coloured graphs, then it must hold that for an 
isomorphism a from G to G', 

{x,y)&Ei m{a{x),a{y))eE[ 

for x,y &V and i G {1, 2}. 

3 Gene Assembly in Ciliates 

Two models that are used to formalize the process of gene assembly in ciliates 
are the string pointer reduction system (SPRS) and the graph pointer reduction 
system (GPRS). The SPRS consist of three types of string rewriting rules oper- 
ating on legal strings while the GPRS consist of three types of graph rewriting 
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rules operating on overlap graphs. For the purpose of this paper it is not nec- 
essary to recall the string and graph rewriting rules; a complete description of 
SPRS and GPRS, as well as a proof of their "weak" equivalence, can be found 
in [3]. We do recall the notions of legal string and overlap graph, and we also 
recall the notion of realistic string. 

We fix K > 2, and define the alphabet A = {2, 3, . . . , k}. For C A, we define 
D = {a \ a e D} and Ud ^ D U D; also 11 = IIa. The elements of 11 will be 
called pointers. We use the "bar operator" to move from A to A and back from 

A A = I p if p e A 

A to A. Hence, for p G 11, p = p. For p e 11, we define p = s _ - , i.e., 

yp if p e A 

p is the "unbarred" variant of p. 

For a string u = xiX2---Xn with Xi Cz li [1 < i < n), the complement of 
u is X1X2 ■ ■ ■ Xn. The inverse of u, denoted by u, is the complement of the 
reversal of w, thus u — XnXn-i ■ ■ '^i- The domain of u, denoted by dom(u), is 
{p I p occurs in v}. We say that u is a legal string if for each p G dom{u), u 
contains exactly two occurrences from {p,p}. 

We define the alphabet e„ = {Mi, Mi | 1 < i < k}. We say that 5 G 9* is 
a micronuclear arrangement if for each i with \ < i < n, 5 contains exactly 
one occurrence from {M^, M^}. With each string over 0^, we associate a unique 
string over 11 through the homomorphism tt^ : O* — > H* defined by: 

7r«(Mi) = 2, 7r«,(M„) = K, it^{M,) ^ i{i + \) for 1 < i < k. 



and Tin{Mj) — ^/^{Mj) for 1 < j < k. We say that string u is a realistic string 
if there is a micronuclear arrangement 5 such that u = 7r„ [5) . We then say that 
J is a micronuclear arrangement for u. 

Note that every realistic string is a legal string. However, not every legal string 
is a realistic string. For example, a realistic string cannot have "gaps" (missing 
pointers): thus 2244 is not realistic while it is legal. It is also easy to produce 
examples of legal strings which do not have gaps but still are not realistic — 
3322 is such an example. Realistic strings are most useful for the gene assembly 
models, since only these legal strings can correspond to genes in ciliates. 

For a pointer p and a legal string u, if both p and p occur in u then we say that 
both p and p are positive in u; if on the other hand only p or only p occurs in 
u, then both p and p are negative in u. So, every pointer occurring in a legal 
string is either positive or negative in it. Therefore, we can define a partition 
of dom(u) = pos(m) U neg(u), where pos(u) = {p G dom(u) | p is positive in u] 
and neg(u) = {p G dom(M) | p is negative in u}. 

Let u — X1X2 • • • x„ be a legal string with Xi £ II for 1 < i < n. For a pointer 
p G H such that {xi,Xj} C {p,p} and 1 < i < j < n, the p-interval of u is 
the substring XiXi+i • ■ ■ Xj. Substrings Xi-^ ■ ■ ■ Xj^ and Xi^ ■ ■ ■ Xj^ overlap in u if 
ii < 12 < ji < j2 or 12 < ii < j2 < ji- Two distinct pointers p, g G H overlap 
in u if the p-interval of u overlaps with the g-interval of u. Thus, two distinct 
pointers p, q G H overlap in u iff there is exactly one occurrence from {p, p} in 
the g-interval, or equivalently, there is exactly one occurrence from {q, q} in the 
p-interval of u. Also, for p G dom(w), we denote 

Ou{p) = {q G dom(u) | p and q overlap in u}. 
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Figure 1: The overlap graph of legal string u — 24535423. 

and for < i < j < n, we denote by Ou{i,j) the set of all p £ dom(7i) such that 
there is exactly one occurrence from {p,p} in ■ ■ 'Xj. Also, we define 

O«0, i) = Ou{i,j)- Intuitively, 0„(i, j) is the set of p G dom(u) for which the the 
substring between "positions" i and j in u contains exactly one representative 
from {p,p}, where position i for < z < n means the "space" between Xi and 
Xi+i in u. For i = it is the "space" on the left of xi, and ior i = n it is the 
"space" on the right of a;„. A few elementary properties of 0„(«, j) follow. We 
have Ou{i,n) = 0^(0, i) for i with < i < n. Moreover, for z, j. A; G {0, . . . ,71}, 
Ouihj) ® k) ~ 0„(i, fc); this is obvious when i < j < k, but it is valid in 
general. Also, for < i < j < n, Ou{i,j) = iff Xi+i ■ ■ ■ xj is a legal string. 

Definition 1 

Let u be a legal string. The overlap graph of u, denoted by 7„, is the labelled 
graph 

{dom{u),E, CT, {+,-}), 

where 

E — {{p, q} \ p,q dom(u),p ^ q, and p and q overlap in u}, 
and a is defined by: 




+ if p e pos(u) 
— if p G neg(M) 



for all p G dom(M) . h 
Example 1 

Let u = 24535423 be a legal string. The overlap graph of u is 

7 = ({2, 3, 4, 5}, {{2, 3}, {4, 3}, {5, 3}}, a, {+, -}), 
where (t(w) = — for all vertices w of 7. The overlap graph is depicted in Figure [T] 

Let 7 be an overlap graph. Similar to legal strings, we define dom(7) as the set 
of vertices of 7, pos(7) = {p G dom(7) | a{p) — +}, neg(7) = {p G dom(7) | 
cr(p) = — } and for q G dom(M), Oj{q) = {p G dom(7) | {p, q} £ E}. 

An overlap graph 7 is realistic if it is the overlap graph of a realistic string. Not 
every overlap graph of a legal string is realistic. For example, it can be shown 
that the overlap graph 7 of u = 24535423 depicted in Figure [1] is not realistic. 
In fact, one can show that it is not even realizable — there is no isomorphism 
a such that a (7) is realistic. 
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4 The Reduction Graph 



We now recall the (full) reduction graph, which was first introduced in j^. 
Remark 

Below we present this graph in a slightly modified form: we omit the special 
vertices s and t, called the source vertex and target vertex respectively, which 
did appear in the definition presented in [5]. As shown in Section[51 in this way a 
realistic overlap graph corresponds to exactly one reduction graph. Fortunately, 
several results concerning reduction graphs do not rely on the special vertices, 
and therefore carry over trivially to reduction graphs as defined here. 

Definition 2 

Let u = P1P2 ■ ■ - Pn with pi, . . . ,p„ G 11 be a legal string. The reduction graph 
of u, denoted by TZu, is a 2-edge coloured graph 

(]/,i^i,i?2,/,dom(M)), 

where 

l/={/l,/2,...,/„} u 

El = {ei, 62, . . . , e„} with = {/-, h+i} for 1 < i < n - f , e„ = {I'„, h}, 

E2 - {{/;, {U, I'j] M, j e {f , 2, . . . , n} with i ^ J and p, = Pj} U 

I hj e {1,2,. and p., and 

/(/,) = /(ZO = p, for 1 < z < n. 

■ 

The edges of Ei are called the reality edges, and the edges of E2 are called the 
desire edges. Intuitively, the "space" between pi and pi+i corresponds to the 
reality edge = Hence, we say that i is the position of e^, denoted 

by posn(ei), for all i e {1,2, ... ,n}. Note that positions are only defined for 
reality edges. Since for every vertex v there is a unique reality edge e such that 
V G e, we also define the position of v, denoted by posn(D), as the position of e. 
Thus, posn(/j') = posn(/i_|_i) — i (while posn(/i) — n). 

Example 2 

Let u = 324324 be a legal string. Since 432 can not be a substring of a realistic 
string, u is not realistic. The reduction graph TZu of u is depicted in Figured) 
The labels of the vertices are also shown in this figure. Note the desire edges 
corresponding to positive pointers (here 2 and 4) cross (in the figure), while 
those for negative pointers are parallel. Since the exact identity of the vertices 
in a reduction graph is not essential for the problems considered in this paper, 
in order to simplify the pictorial representation of reduction graphs we will 
omit this in the figures. We will also depict reality edges as "double edges" to 
distinguish them from the desire edges. Figure [3] shows the reduction graph in 
this simplified representation. 
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Figure 2: The reduction graph of u of Example [21 

2^=4 2^=3 3^=4 

2^=4 2^=3 3^=4 

Figure 3: The reduction graph of u of Example [2] in the simplified representation. 

3 3 6 6 2 2 

7 7 5 5 4 4 

2 2 3 3 4 4 

7 7 6 6 5 5 

Figure 4: The reduction graph of u of Example [31 
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Example 3 

Let u = TTriMjMiMeMsMsM^Mi) = 726734563245. Thus, unlike the previous 
example, u is a realistic string. The reduction graph is given in Figure SI As 
usual, the vertices are represented by their labels. 

The reduction graph is defined for legal strings. In this paper, we will show 
how to directly construct the reduction graph of realistic string u from only the 
overlap graph of m. In this way we can define the reduction graph for realistic 
overlap graphs in a direct way. 

Next we consider sets of overlapping pointers corresponding to pairs of vertices 
in reduction graphs, and start to develop a calculus for these sets that will later 
enable us to characterize the existence of certain edges in the reduction graph, 
cf. Theorem [T5] 

Example 4 

We again consider the legal string u = 324324 and its reduction graph TZu from 
Example [21 Desire edge e — {^21 -^5} is connected to reality edges ei = {l2,h} 
and 62 — {/gj/e} with positions 2 and 5 respectively. We have 0„(2,5) = 
{2,3,4}. Also, reality edges {I[,l2} and {l2:h} have positions 1 and 2 respec- 
tively. We have 0„(1, 2) = {2}. 

Lemma 3 

Let M be a legal string. Let e = {vi,V2} be a desire edge of TZu and let p be the 
label of both vi and V2. Then 



0„(posn(wi),posn(i;2)) = 




if p is negative in u, 
© {p} if p is positive in u. 



Proof 

Let u = P1P2 ■ ■ - Pn with pi , p2 , ■ • ■ , Pn G II and let i and j be such that i < j and 
p — Pi ~ Pj. Without loss of generality, we can assume posn(ui) < posn(u2). 
Then, vi G {li, I-} and V2 £ {/j, /j}, hence posn(wi) G {« — 1, i} and posn(w2) £ 
{J-1,J}- 

First, assume that p is negative in u. By the definition of reduction graph, the 
following two cases are possible: 

1. e = thus 0„(posn(/i),posn(/j)) = 0„(j - 1, j) = 0„(p), 

2. e ^ thus 0„(posn(/,),posn(/;-_i)) = 0„(i, j - 1) = 0„(p), 

Thus in both cases we have 0„(posn(ui), posn(w2)) = Ou{p)- 

Finally, assume that p is positive in u. By the definition of reduction graph, the 
following two cases are possible: 

1. e = thus 0„(posn(/i),posn(/,)) = 0„(i-l, = 0„(p)©{p}, 

2. e = {/^/j}, thus 0„(posn(/;),posn(/j)) = 0„(^,J) = Ou{p) © M, 

Thus in both cases we have 0„(ii, 12) — Ou{p) ® {p}- ■ 
The following result follows by iteratively applying the previous lemma. 
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Corollary 4 

Let u be a legal string. Let 



Pa = Pi Pi = P2 P2 = ■■ = Pn Pn = Pn+1 

be a subgraph of TZu, where (as usual) the vertices in the figure are represented 
by their labels, and let ei (e2, resp.) be the leftmost (rightmost, resp.) edge. 
Note that ei and 62 are reality edges and therefore posn(ei) and posn(e2) are 
defined. Then 0„(posn(ei), posn(e2)) = (pos(m) n P) ® {®t^p Ou{t)) with P = 
{pi,...,Pn}. 

By the definition of the reduction graph the following lemma holds. 
Lemma 5 

Let w be a legal string. If and I'^ are vertices of TZu, then Ou(posn(/i), posn(/j')) = 
{p}, where p is the label of li and I^. 

Example 5 

We again consider the legal string u and desire edge e as in the previous example. 
Since e has vertices labelled by positive pointer 2, by Lemma [3] we have (again) 
Ou{2, 5) = Oui2) ® {2} = {2, 3, 4}. Also, since I2 and with positions 1 and 2 
respectively are labelled by 2, by Lemma[5]we have (again) 0„(1, 2) = {2}. 

5 The Reduction Graph of Reahstic Strings 

The next theorem asserts that overlap graph 7 for realistic string u retains all 
information of TZu (up to isomorphism). In the next few sections, we will give 
a method to determine TZu (up to isomorphism), given 7. Of course, the naive 
method is to first determine a legal string u corresponding to 7 and then to 
determine the reduction graph of u. However, we present a method that is able 
to construct TZu in a direct way from 7. 

Theorem 6 

Let u and v be realistic strings. If 7„ = 71;, then TZu ~ TZy. 
Proof 

By Theorem 1 in [3] (or Theorem 10.2 in [4J), we have 7„ = 7^ iff w can be 
obtained from u by a composition of reversal, complement and conjugation 
operations. By the definition of reduction graph it is clear that the reduction 
graph is invariant under these operations (up to isomorphism). Thus, TZu ~ TZ^m 

The previous theorem is not true for legal strings in general — the next two 
examples illustrate that legal strings having the same overlap graph can have 
different reduction graphs. 

Example 6 

Let u — 2653562434 and v — h{u), where h is the homomorphism that inter- 
changes 5 and 6. Thus, v — 2563652434. Note that both u and v are not realistic, 
because substrings 535 of u and 636 of v can obviously not be substrings of real- 
istic strings. The overlap graph of u is depicted in Figure [5] From Figure [5] and 
the fact that v is obtained from u by renumbering 5 and 6, it follows that the 
overlap graphs of u and v are equal. The reduction graph TZu of u is depicted 
in Figure [HI The reduction graph TZy of v is obtained from TZu by renumbering 
the labels of the vertices according to h. Clearly, TZu 76 TZv. 
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Figure 5: The overlap graph of both legal strings u and v of Example [HI 



2 6 5 6 



2 6 5 6 



2 4 4 3 3 5 



2 4 4 3 3 5 

Figure 6: The reduction graph of u of Example [HI 



2 3 4 2 2 3 




Figure 7: The reduction graph of u of Example [T] 



2 4 2 3 3 4 




Figure 8: The reduction graph of v of Example [T] 
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Example 7 

Let u = 7rf,{MiM2M3Mi) = 223344 be a realistic string and let v = 234432 be 
a legal string. Note that v is not realistic. Legal strings u and v have the same 
overlap graph 7 (7 = ({2, 3, 4}, 0, a, {+, — }), where a{v) = — for u e {2, 3, 4}). 
The reduction graph TZu of u is depicted in Figure [3 and the reduction graph 
TZv of V is depicted in Figure [HI Note that TZu has a component consisting of six 
vertices, while TZy does not have such a component. Therefore, TZu 7^ TZv- 

For realistic strings the reduction graph has a special form. This is seen as fol- 
lows. For 1 < i < K the symbol Mi (or Mi) in the micronuclear arrangement 
defines two pointers pi and Pi+i (or pi+i and pi) in the corresponding realistic 
string u. At the same time the substring piPi^i (or pi^ipi, resp.) of u corre- 
sponding to Mi (or Mi, resp.) defines four vertices Ij , I'j, Ij+i, 1^^^ in TZu- It 
is easily verified (cf. Theorem |S] below) that the "middle" two vertices /j and 
Ij+i, labelled by pi and Pi+i respectively, are connected by a reality edge and /j 
{Ij+i, resp.) is connected by a desire edge to a "middle vertex" resulting from 
Mi-i or Mi^i (Mi+i or Mi+i, resp.). This leads to the following definition. 

Definition 7 

Let u be a legal string and let k = ldom(M)| + 1. If TZu contains a subgraph L 
of the following form: 

2 2^=3 3^=-^=K K 

where the vertices in the figure are represented by their labels, then we say that 
u is rooted. Subgraph L is called a root subgraph of TZu- ■ 

Example 8 

The realistic string u with dom(u) — {2, 3, . . . , 7} in Example[3]is rooted because 
the reduction graph of u, depicted in Figure [H contains the subgraph 

2 2^3 3^ •• ^7 7 

The next theorem shows that indeed every realistic string is rooted. 
Theorem 8 

Every realistic string is rooted. 
Proof 

Consider a micronuclear arrangement for a realistic string u. Let k — |dom(M) | -|- 
1. By the definition of tt^, there is a reality edge (corresponding to either 
TTK,{Mi) = i{i + 1) or TTK{Mi) = (i -t- 1) i) connecting a vertex labelled by i to a 
vertex labelled by i -I- 1 for each 2 < i < k. It suffices to prove that there is a 
desire edge connecting a to e^+i for each 2 < i < k — 1. This can easily be seen 
by checking the four cases where Ci corresponds to either TTK.{Mi) or TTK.{Mi) and 
Ci+i corresponds to either ^^^{Mi^i) or 7r„(Mi4.i). ■ 

In the remaining of this paper, we will denote |dom(u)| 4- 1 by k for rooted 
strings, when it is clear which rooted string u is meant. The reduction graph 
of a realistic string may have more than one root subgraph: it is easy to verify 
that realistic string 234 • ■ • k234 ■ • ■ k for k > 2 has two root subgraphs. 

Example [2] shows that not every rooted string is realistic. The remaining results 
that consider realistic strings also hold for rooted strings, since we will not be 
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using any properties of realistic string that are not true for rooted strings in 
general. 

For a given root subgraph L, it is convenient to uniquely identify every reahty 
edge containing a vertex of L. This is done through the following definition. 

Definition 9 

Let M be a rooted string and let L be a root subgraph of TZu- We define rsposi^ f. 
for 2 < fc < K as the position of the edge of L that has vertices labelled by k 
and k + 1. We define rsposj^ i {rsposi^^^, resp.) as the position of the edge of 
TZu not in L containing a vertex of L labelled by 2 (k, resp.). When k = 2, to 
ensure that rsposj^ -^ and rsposj^ ^ are well defined, we additionally require that 
rsposi^ -y < rspos^ ^. m 

Thus, rspoS]^ ^ (for 1 < fc < k) uniquely identifies every reality edge containing 
a vertex of L. If it is clear which root subgraph L is meant, we simply write 
rspoSf. instead of rsposj^ ^. for 1 < fc < k. 

The next lemma is essential to prove the main theorem of this paper. 
Lemma 10 

Let u be a rooted string. Let L be a root subgraph of Let i and j be positions 
of reality edges in TZu that are not edges of L. Then Ou{i,j) = iS i = j. 

Proof 

The reverse implication is trivially satisfied. We now prove the forward impli- 
cation. The reality edge (for 2 < A; < k) in i with vertices labelled by k 
and k + 1 corresponds to a cyclic substring Mfc e {piP2,P2Pi | Pi € {k, k},p2 G 
{k + l,k + 1}} of u. Let fci and ^2 with 2 < ki < k2 < k. li ki + 1 = k2, then 
and are connected by a desire edge (by the definition of L). Therefore, 
pointer fcg common in M/.^ and M^^ originates from two different occurrences 
in u. If on the other hand ki + 1 k2, then Mk-^ and Mk-j do not have a letter in 
common. Therefore, in both cases, M^^ and M^^ are disjoint cyclic substrings 
of u. Thus the Mk for 2 < fc < «; are pairwise disjoint cyclic substrings of u. 

Without loss of generality assume i < j. Let u = uiU2 ■ ■ - Un with Ui G 11. Since 
u is a legal string, every u; for 1 < ? < n is either part of a (with 2 < A; < k) 
or in {2, 2, n, R}. Consider u' = Ui+iUi+2 ■ ■ - Uj- Since i and j are positions of 
reality edges in TZu that are not edges of L, we have u' = Mk^Mk^ ■ ■ ■ for 
some distinct fci, A:2, • • • , fcm G {1, 2, . . . , «•}. where Mi E {2, 2} and G {k, k}. 

It suffices to prove that u' = A. Assume to the contrary that u' ^ A. Then there 
is a 1 < Z < k; such that Mi is a substring of u'. Because Ou{i,j) = 0, we know 
that u' is legal. If Z > 1. then Mi-i is also a substring of m', otherwise it' would 
not be a legal string. Similarly, ii I < k, then Mj+i is also a substring of u' . By 
iteration, we conclude that u' = u. Therefore, i = 0. This is a contradiction, 
since cannot be a position of a reality edge. Thus, u' = X. m 

Lemma 11 

Let u be a rooted string. Let i be a root subgraph of 7^„. If /, and I- are vertices 
of TZu, then exactly one of Ii and I- belongs to L. 

Proof 

By the definition of reduction graph, Ii and I^ have a common vertex label p 
but are not connected by a desire edge. Therefore, and I- do not both belong 
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to L. Now, if li and I'^ both do not belong to L, then the other vertices labelled 
by p, which are Ij and for some j, both belong to L - a contradiction by the 
previous argument. Therefore, either li or I[ belongs to L, and the other one 
does not belong to L. ■ 

The next result provides the main idea to determine the reduction graph given 
(only) the overlap graph as presented in Section [T] It relies heavily on the 
previous lemmas. 

Theorem 12 

Let M be a rooted string, let L be a root subgraph of 7?.„, and let p,q £ dom(M) 
with p < q. Then there is a reality edge e in TZu with both vertices not in i, 
one vertex labelled by p and the other labelled by q iff 

Ou{t) = (pos(w) n P) © {p} © {q}, 

where P = {p + 1, . . . ,q — 1} L) P' for some P' C {p, q}. 
Proof 

We first prove the forward implication. Let e = {vi,V2} with vi labelled by 
p, V2 labelled by q, and posn(e) = i. Thus e = {J^', We assume that 
vi = I[ and V2 — li+i, the other case is proved similarly. Let ii = posn(/i) 
and 12 = posn(/j'_j_j^). By Lemma [5] Oti(i, zi) — {p} and 0„(j2,j) — {<?}• By 
Lemma [TTl li (labelled by p) and (labelled by q) belong to L. Thus ii € 
{rvpoSp_i,rspoSp} and i2 G {r^pos^^i^r^pos^}. By applying Corollary[3]on L, we 
have Ou{ii,i2) = (pos(m) n P) © (©^^p 0„(i)) with P = {p+ 1, . . . ,q - 1} U P' 
for some P' C {p, q}. By definition of Ou{i,j) we have 

= Ou{i,i) = Ou{i,ii) © 0„(ii,?2) © 0„(«2,i) 

Thus the desired result follows. 

We now prove the reverse implication. By applying Corollary 3] on L, we have 
Ouihih) = (pos(w) n P) © (0tgpO„(i)) for some ii G {rspoSp_i,rspoSp} and 
*2 G {'^poSq-ii rspos^} (depending on P'). By LemmaO there is a vertex vi (u2, 
resp.) labelled by p {q, resp.) with position i (j, resp.) such that Ou{h ii) = {p} 
and Oit(«2,i) = {9}- By Lemma [Tl] these vertices are not in L. We have now 

= Ou{i,ii) © 0„ (11,^2) © Ou{i2,j) = 0„(i, j) 

By Lemma [TUl Ou{i,j) = implies that i = j. Thus, there is a reality edge 
{vi, V2} in TZu (with position i), such that vi is labelled by p and W2 is labelled 
by q and both are not vertices of L. m 

Let be the overlap graph of some legal string u. Clearly we have pos(u) = 
pos(7„) and for all p G dom(u) = dom(7„), 0„(p) — 0^^{p). Thus by Theo- 
rem [12] we can determine, given the overlap graph of a rooted string u, if there 
is a reality edge in TZu with both vertices outside L that connects a vertex la- 
belled by p to a vertex labelled by q. We will extend this result to completely 
determine the reduction graph given the overlap graph of a rooted string (or a 
realistic string in particular). 
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3 6 2 



7 5 4 



2 3 4 



7 6 5 

Figure 9: The labelled graph cps(7?.„), where TZu is defined in Example [HI The 
vertices in the figure are represented by their labels. 

6 Compressing the Reduction Graph 

In this section we define the cps function. The cps function simplifies reduction 

graphs by replacing the subgraph P P by a single vertex labelled by p. In 

this way, one can simplify reduction graphs without "losing information". We 
will define cps for a general family of graphs G which includes all reduction 
graphs. The formal definitions of Q and cps are given below. 

Let Q be the set of 2-edge coloured graphs G — (V, Ei, E2, /, F) with the prop- 
erty that for all {vi,V2} 6 £'2, it holds that /(wi) = f{v2)- Note that for a 
reduction graph 7^„, we have TZu G Q because both vertices of a desire edge 
have the same label. For aW G ^ Q, cps(G) is obtained from G by considering 
the second set of edges as vertices in the labelled graph. Thus, for the case 
when G is a reduction graph, the function cps "compresses" the desire edges to 
vertices. 
Definition 13 

The function cps from Q to the set of labelled graphs is defined as follows. Let 
G={V,Ei,E2j,T)&g, then 

cps(G) = (i?2,£;i,/',F) 

is a labelled graph, where 

E'l = {{61,62} C E2 I 3wi,D2 G : ui e 6i,W2 e 62,61 ^ 62 and {wi,W2} e Ei}, 
and for e G £'2: f'{&) = f{v) with v £ e. m 
Note that /' is well defined, because for all {vi, V2} € E2, it holds that /(wi) = 

Example 9 

We are again considering the realistic string u defined in Example [31 The re- 
duction graph of TZu is depicted in Figured] The labelled graph cps (??.„) is 
depicted in Figure[9l Since this graph has just one set of edges, the reality edges 
are depicted as 'single edges' instead of 'double edges' as we did for reduction 
graphs. 
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It is not hard to see that for reduction graphs TZu and TZy , we have TZu ~ TZv iff 
cps(7?.„) « cps(7^^). In this sense, function ops aUows one to simpUfy reduction 
graphs without losing information. 

7 From Overlap Graph to Reduction Graph 

Here we define reduction graphs for reahstic overlap graphs, inspired by the 
characterization of Theorem 1121 In the remaining part of this section we will 
show its equivalence with reduction graphs for realistic strings. 

Definition 14 

Let 7 = {Domy, E^, cr, {+, — }) be a realistic overlap graph and let k = |Dom^| + 
1. The reduction graph of ^, denoted by TZ-^, is a labelled graph 

= {V,EJ,Dom^), 

where 

V = {Jp,4\2<p< «}, 
fiJp) = fiJp) = P, for 2 < p < «, and 
e £ E iS one of the following conditions hold: 

1. e = { Jp, Jp+il and 2 < p < k. 

2. e = { Jp, Jq}, 2 < p < q < K, and 

0^{t) - (pos(7) n P) ® {p} © {q}, 

where P ~ {p + 1, . . . ,q — 1} IJ P' for some P' C {p, q}. 

3. e — { J2, Jp}, 2 < p < K, and 

0o^(t) = (pos(7)nP)®M, 

teP 

where P = {2, . . . ,p - 1} U P' for some P' C {p}. 

4. e = { J^, Jp}, 2 < p < K, and 

0o^(t) = (pos(7)nP)®M, 

teP 

where P = {p + 1, . . . , k} U P' for some P' C {p}. 

5. e — { J2, J^}, K > 3, and 

00^(0 =pos(7)nP, 
teP 

where P = {2, . . . , k}. ■ 
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Figure 10: The overlap graph 7 of a reahstic string (used in Example fTO]) . 




Figure 1 1 : The reduction graph TZ~f of the overlap graph 7 of Example 1101 The 
vertices in the figure are represented by their labels. 



Example 10 

The overlap graph 7 in Figure [Till is realistic. Indeed, for example realistic string 
u = TT7{MiM3M'rM5M2MiM6) = 453475623267 has this overlap graph. Clearly, 
the reduction graph TZ^ of 7 has the edges { J^, Jp+i} for 2 <p < 7. The follow- 
ing table lists the remaining edges of TZ-, . The table also states the characterizing 
conditions for each edge as stated in Definition 1141 Note that pos(7) — 0, and 
consequently the right-hand side of the defining equations in points 2, 3 and 4 
in Definition [HI are independent of the choice of P' . 

Witness 



Edge 


P 


{J2,Je} 


{3,4,5} 




{2,3,4,5,6} 


{J4, J7} 


{5,6} 


{^4, J7} 


{4,5,6,7} 


{^3,J5} 


{4} 


{^5,4} 


{6,7} 




{2} 



{2, 4, 5, 6, 7} ©{3, 5} ©{3, 4, 7} = {2, 6} 
{3} © {2, 4, 5, 6, 7} © {3, 5} © {3, 4, 7} © {3} = {2, 6} 

{3, 4, 7} ©{3} = {4, 7} 
{3, 5} © {3, 4, 7} © {3} © {3, 5} = {4, 7} 

{3, 5} = {3, 5} 
{3} ©{3, 5} -{5} 
{3} = {3} 

We have now completely determined TZ-y] it is shown in Figure [TTJ As we have 
done for reduction graphs of legal strings, in the figures, the vertices of reduction 
graphs of realistic overlap graphs are represented by their labels. 



Example 11 

In the second example we construct the reduction graph of an overlap graph that 
contains positive pointers. The overlap graph 7 in Figure [T^ is realistic. Indeed, 
for example realistic string u = Tr^iMrMiMeMsM^'M^Mi) = 726734563245 
introduced in Example [3] has this overlap graph. Again, the reduction graph TZ^ 
of 7 has the edges { J^, Jp+i} for 2 < p < 7. The remaining edges are listed in 
the table below. 
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Edge 


P 


Witness 


L "-^ ; 1 J 


14,5,6) 


{2, 3, 5, 6} © {2, 3, 4, 6} ® {3, 4, 5, 7} = {3, 7} 


{^3,^6} 


{3,4,5} 


{3} © {4, 5, 6} ® {2, 3, 5, 6} © {2, 3, 4, 6} = {3, 6} 




{2,3,4,5,6} 


{2} © {4, 5, 7} © {3} © {4, 5, 6} © {2, 3, 5, 6} 






©{2,3, 4, 6} ©{3, 4, 5, 7} = {2, 6} 


{J2, J4} 


{3,4} 


{3} ©{4, 5, 6} ©{2, 3, 5, 6} = {2, 4} 


{J4, J5} 


{4,5} 


{2, 3, 5, 6} ©{2, 3, 4, 6} = {4, 5} 


{.h,J7} 


{5,6,7} 


{2, 3, 4, 6} ©{3, 4, 5, 7} ©{2, 6} = {5, 7} 




{2,. ..,7} 


{2}©{4,5,7}©...©{2,6} = 



Again, we have now completely determined the reduction graph; it is shown in 
Figure m 



Figures El and [13] show, for u = 726734563245, that cps(7e„) « 7^^. The next 
theorem shows that this is true for every realistic string u. 

Theorem 15 

Let u be a realistic string. Then, cps(7?.„) w TZ-^^^. 
Proof 

Let K = |dom(u)| + 1, let 7 = 7„, let TL-^ — {V.y,E^, f-y,AoTa.{u)), let Ru — 
cps(7?.„) = (14, /„, dom(u)) and let L be a root subgraph of TZu- Recall that 
the elements of Vu are the desire edges of TZu ■ 

Let h -.Vu defined by 

, ^ \ Jf f ul if ^' is not an edge of L 
h[v) — < ' 

I Jf^i^y) if V is an edge of L 

We will show that h is an isomorphism from Ru to TZ-y. Since for every I S 
dom(u) there exists exactly one desire edge v of TZu that belongs to L with 
fu{v) = I and there exists exactly one desire edge v of TZu that does not belong 
to L with fu{v) = ^, it follows that h is one-to-one and onto. Also, it is clear 
from the definition of /..,, that = f-y{h{v)). Thus, it suffices to prove that 

{vi,V2) eEu^ {h{vi),h{v2)} e E^. 

We first prove the forward implication {wi,W2} G i?n => {h{vi) , h{v2)} £ E^. 
Let {vi,V2} e Eu, let p = /„(wi) and let q = fu{v2)- Clearly, vi ^ V2. By the 
definition of cps, there is a reality edge e = {v\,V2\ of TZu with v\ g v\ and 
V2 G V2 (and thus wi and V2 are labelled by p and g in respectively). Let i 
be the position of e. We consider four cases (remember that v\ and V2 are both 
desire edges of TZu)'- 

1. Assume that e belongs to L. Then clearly, v\ and i;2 are edges of L. 
Without loss of generality, we can assume that p < q. From the structure 
of root subgraph and the fact that e is a reality edge of TZu in L, it follows 
that q = p + 1. Now, h{vi) = Jp and h{v2) — J'g — J'p^i- By the first 
item of the definition of reduction graph of an overlap graph, it follows 
that {h{vi),h{v2)} ~ {J'piJ'p+i} ^ ^i- "^'^'^^ proves the first case. In the 
remaining cases, e does not belong to L. 

2. Assume that vi and V2 are both not edges of L (thus e does not belong to 
L). Now by Theorem W]\ and the second item of the definition of reduction 
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graph of an overlap graph, it foUows that {h{vi), ^(^2)} — {Jp, Jq} £ £'7- 
This proves the second case. 

3. Assume that either vi or V2 is an edge of L and that the other one is 
not an edge of L (thus e does not belong to L). We follow the same line 
of reasoning as we did in Theorem [T^l Without loss of generality, we can 
assume that vi is not an edge of L and that V2 is an edge of L. Clearly, 



for each position ii. By the structure of L we know that q — 2 or q — k. 
We prove it for the case q = 2 (q = k, resp.). By Lemma[5]and Lemma [TT| 
we can choose ii G {rspoSp_i,rspoSp} such that Ou{ii,i) — {p}- By ap- 
plying CoroUaryHon L, we have 0„(i, ii) = (pos(m) HP)® (0fgp 
with P = {2, . . . ,p - 1} U P' (P = {p + 1, . . . , k} U P', resp.) for some 
P' Q {p}- By the third (fourth, resp.) item of the definition of reduction 
graph of an overlap graph, it follows that {h{vi) , h{v2)} = {J2J Jq} £7 
{{h{vi) , h{v2)} = { J^j Jq} G -^71 resp.). This proves the third case. 

4. Assume that both vi and V2 are edges of L, but e does not belong to 
L. Again, we follow the same line of reasoning as we did in Theorem 1121 
Without loss of generality, we can assume that p < q. By the structure of 
L, we know that p = 2 and q = k > 3. By applying CoroUaryHon L, we 
have = Ouii, i) = (pos(m) n P) ® (0tgp 0«(i)) with P = {2, . . . , k}. 
By the fifth item of the definition of reduction graph of an overlap graph, 
it follows that {h{vi), h{v2)} = { J2, J'^} G E-y- This proves the last case. 

This proves the forward implication. We now prove the reverse implication 
{vi,V2} e £^7 ^ {h~^{vi),h~^{v2)} e Eu, where h~^, the inverse of h, is 
given by: 

h^^{Jp) is the unique w G with fu{v) = p that is not an edge of L, 
h^^{Jp) is the unique v €Vu with /„(w) = p that is an edge of L, 

for 2 < p < K. Let e E E^. We consider each of the five types of edges in the 
definition of reduction graph of an overlap graph. 

1. Assume e is of the first type. Then e = { J^, Jp+i} for some p with 2 < p < 
K. Since h^^{Jp) is the desire edge of L with both vertices labelled by p 
and h~^{Jp_^_i) is the desire edge of L with both vertices labelled by p+l, it 
follows, by the definition of root subgraph, that h^^{Jp) and h^^{Jp+i) are 
connected by a reality edge in L. Thus, we have {h^^{Jp),h~^{Jp_^_i)} e 
Eu. This proves the reverse implication when e is of the first type (in 
Definition [Til). 

2. Assume e is of the second type. Then e = {Jp,Jq} for some p and q with 
2 < p < q < K and 



\teP / 

with P = {p + 1, . . . , (J - 1} U P' for some P' C {p, q}. By Theorem [H 
there is a reality edge {wi, W2} in TZu, such that wi has label p and W2 has 



= Ou[i,i) = Ou{i,ii) ® Ou{ii,i) 
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Figure 14: The reduction graph of u of Example [T21 The vertices in the figure 
are represented by their labels. 

label q and both are not vertices of L. By the definition of cps, we have 
a {w'i,w'2\ G Eu such that fu{w'i) = p {fu{w2) = q, resp.) and w'-^ {w'2, 
resp.) is not an edge of L. Therefore w[ = h^^{Jp) and w'2 — h^^{Jq). 
This proves the reverse implication when e is of the second type. 

3. The last three cases are proved similarly. 

This proves the reverse implication and we have shown that h is an isomorphism 
from Ru to TZ^. m 

Example 12 

The realistic string u = 453475623267 was introduced in Example [TOl The 
reduction graph TZj of the overlap graph of u is given in Figure fTTI The reduction 
graph TZu of u is given in Figure [TH It is easy to see that the result of applying 
cps to TZu is a graph that is indeed isomorphic to TZ-y. This makes clear why 
there were two proofs for both edges { J2, Je} and { J4, J7} in Example [TUl each 
one corresponds to one reality edge in TZu (outside L). 

Formally, we have not yet (up to isomorphism) constructed the reduction graph 
TZu of a realistic string u from its overlap graph. We have 'only' constructed 
cps(7?.„) (up to isomorphism). However, it is clear that TZu can easily be obtained 
from ci>s{TZu) (up to isomorphism) by considering the edges as reality edges and 
replacing every vertex by a desire edge of the same label. 

8 Consequences 

Using the previous theorem and |5] (or Chapter 13 in [4]), we can now eas- 
ily characterize successfulness for realistic overlap graphs in any given S C 
{Gnr,Gpr,Gdr}. The notions of successful reduction, string negative rule and 
graph negative rule used in this section are defined in [J] . 

Below we restate a theorem of [3] . 
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Theorem 16 

Let N be the number of components in TZu- Then every successful reduction of 
u has exactly — 1 string negative rules. 

Due to the 'weak equivalence' of the string pointer reduction system and the 
graph pointer reduction system, proved in Chapter 11 of we can, using 
Theorem 1151 restate Theorem 1161 in terms of graph reduction rules. 

Theorem 17 

Let u be a realistic string, and N be the number of components in TZ^^. Then 
every successful reduction of 7„ has exactly — 1 graph negative rules. 

As an immediate consequence we have the following corollary. It provides a 
solution to an open problem formulated in Chapter 13 in [1]. 

Corollary 18 

Let w be a realistic string. Then is successful in {Gpr, Gdr} iff TZj^ is con- 
nected. 

Example 13 

Every successful reduction of the overlap graph of Example [10] has exactly two 
graph negative rules, because its reduction graph consist of exactly three com- 
ponents. For example gnr4 gdrg 7 gnr2 gdr^ e a successful reduction of this 
overlap graph. 

Every successful reduction of the overlap graph of Example [TT] has exactly one 
graph negative rule. For example gnrj gpr4 gprg gpry gprg gprg is a successful 
reduction of this overlap graph. 

With the help of [5] (or Chapter 13 in [4]) and Corollary [TSl we are ready to 
complete the characterization of successfulness for realistic overlap graphs in 
any given S C {Gnr, Gpr, Gdr}. 

Theorem 19 

Let u be a realistic string. Then 7,; is successful in: 

• {Gnr} iff 7„ is a discrete graph with only negative vertices. 

• {Gnr, Gpr} iff each component of 7„ that consists of more than one vertex 
contains a positive vertex. 

• {Gnr, Gdr} iff all vertices of 7„ are negative. 

• {Gnr, Gpr, Gdr}. 

• {Gdr} iff all vertices of 7,1 are negative and TZ-y^ is connected. 

• {Gpr} iff each component of 7„ contains a positive vertex and TZ-y^ is 
connected. 

• {Gpr, Gdr} iff TZ^^ is connected. 
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9 Discussion 



We have shown how to directly construct the reduction graph of a reahstic string 
u (up to isomorphism) from the overlap graph 7 of u. Prom a biological point of 
view, this allows one to reconstruct a representation of the macronuclcar gene 
(and its waste products) given only the overlap graph of the micronuclear gene. 
Moreover, this results allows one to (directly) determine the number n of graph 
negative rules that are necessary to reduce 7 successfully. Along with some 
results in previous papers, it also allows us to give a complete characterization 
of the successfulness of 7 in any given S C {Gnr, Gpr, Gdr}. 

It remains an open problem to find a (direct) method to determine this number 

n for overlap graphs 7 in general (not just for realistic overlap graphs). That 
is, a better method than first determining a legal string u corresponding with 7 
and then determining the reduction graph of u. 
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